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DNA POLYMERASES 

The present invention relates to archaeal DNA polymerase variants and their use in 
the amplification of DNA. 

Polymerase chain reaction (PCR) is a method whereby a sequence of DNA may be 
selectively amplified to produce a large sample that may be readily analyzed. A solution 
containing the DNA to be amplified, together with free bases, a polymerase enzyme and 
primers that bind to opposite ends of the two strands of the DNA segment to be replicated, is 
heated to break the bonds between the strands of DNA. When the solution cools, the primers 
bind to the separated strands and the polymerase builds a new strand by joining free bases to 
the primers thereby producing a new strand that is restricted solely to the desired segment. 
PCR enables billions of copies of a small piece of DNA to be produced in several hours. 

Heat stable polymerases are required for this process and one of the most commonly 
used is Taq DNA polymerase from Thermus aquaticus. However,- this enzyme does not 
possess a 3 '-5' exonuclease III function, commonly referred to as "proofreading activity". 
This function removes bases that are mismatched at the 3' end of a primer-template duplex. 
The inability of Taq DNA polymerase to carry out this function results in it being prone to 
base incorporation errors. 

Archaeal DNA polymerases are thermally stable and demonstrate proofreading 
activity. However, native archaeal DNA polymerases are inhibited by deoxyuracil. Archaeal 
DNA polymerases have a "read-ahead" function specifically for uracil. This template 



checking activity scans the template ahead of the replication fork for the presence of uracil 
and stalls polymerisation when uracil is encountered. Thus, the presence of deoxyuracil in 
DNA causes amplification to be stalled when using native archaeal DNA polymerases. This 
is a serious setback since the repetitive heating and cooling cycles of a DNA sample being 
amplified by PCR results in partial, thermally induced deamination of dCTP (a component 
incorporated into newly amplified DNA) to dUTP (which can be incorporated into DNA) and 
deamination of deoxycytidine in the DNA to deoxyuracil. This can result in the native 
archaeal DNA polymerases being unsuitable for PCRs, in particular those concerned with the 
prevention of "carry-over contamination" where PCR is carried out with dUTP rather than 
dTTP. 

It is an object of the present invention to provide modified archaeal DNA polymerases 
that do not have the disadvantage of being inhibited by deoxyuracil and are particularly 
useful in polymerase chain reactions. 

Accordingly, a first aspect of the present invention provides a variant archaeal DNA 
polymerase having a modified amino acid sequence of a wild-type amino acid sequence, the 
modified sequence being in the amino-terminal amino acids that comprise a uracil-binding 
pocket in the wild-type polymerase whereby the variant polymerase has reduced affinity for 
uracil compared to wild-type archaeal DNA polymerases. 

The present invention is based upon research (see the Examples) conducted by the 
inventors that has identified a uracil-binding pocket in archaeal DNA polymerases. They 



realised that this pocket may be altered to provide variant polymerases according to the 
invention that may be beneficially used as described herein. 



The variant archaeal DNA polymerase may be a modification of an archaeal family B 
DNA polymerase. For instance the variant may be derived from any one of the fourteen 
archaeal family B DNA polymerase shown in Figure 1. For instance the variant may be 
derived from the polymerases found in Pyrococcus furiosus (Pfu-Pol), Thermococcus 
gorgonarius (Tgo-Pol), Thermococcus litoralis (Tli-Pol), Thermococcus sp. 9°N-7 (9°N-7- 
Pol), Desulfurococcus strain Tok (DTok-Pol), Pyrobaculum islandicum (Pis-Pol), 
Archaeoglobus fulgidus (Afu-Pol), Sulfolobus acidocaldarius (Sac-Pol), Sulfurisphaera 
ohwakuensis (Soh-Pol), Sulfolobus solfataricus (Sso-Pol), Pyrodictium occultum (Poc-Pol) or 
Aeropyrum pernix (Ape-Pol). It will be appreciated that the variant could also be derived 
from any other archaeal family B DNA polymerase. 



It is preferred that the variant is derived from Pyrococcus furiosus (Pfu-Pol). Pfu-Pol 
used by the inventors has the following amino acid sequence: 



MAILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGKIVRIVDVEK 

VEKKFLGKPITVWKLYLEHPQDVPTIREKVREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAF 

DIETLYHEGEEFGKGPIIMISYADENEAKVITWKNIDLPYVEWSSEREMIKRFLRIIREKDPDIIVTYN 

GDSFDFPYLAKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPTYTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLWDVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 1 1 ITHNVS 

PDTLNLEGCKNYD I APQVGHKFCKD IPGFI PS LLGHLLEERQKI KTKMKETQDP I EKI LLD YRQKAI KLL 

ANS FYGYYGYAKARVT^CKECAESVTAWGRKYIEIjVWKEIjEEKFGFKVXiYIDTDGLYATIPGGESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLET 

ILKHGDVEEAVRIVKEVIQKIJ^NYEIPPEKIxAIYEQI^ 

GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQKTRQVGLTSWL 
NIKKS 

(SEQ ID NO.l) 



The Pfii Pol used by the inventors contains an extra A at position 2. This extra amino 
acid was incorporated because it improves protein expression without affecting the properties 
of the enzyme. The true wild type Pfu Pol begins MILDVDY. The sequence of the true wild 
type is: 

MILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGKIVRIVDVEK 
VEKKFLGKP I TVWKL YLEHPQDVPT I REKVREHPAWD I FE YD I P FAKRYL I DKGL I PMEGEEELKI LAF 
DIETLYHEGEEFGKGPIIMISYADENEAKVITWKNIDLPYVEVVSSEREMIKRFLRIIREKDPDIIVTYN 
GDSFDFPYIi7UCRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINIjPTYTIjEAVYE 
AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLiVGQPLWDVSRSSTG 
NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRES YTGGFVKEPEKGLWENIVYLDFRALYPS I I ITHNVS 
PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKSTQDPIEKILLDYRQKAIKLL 
ANSFYGYYGYAKARWYCKECAESVTAWGRKYIELVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKK 
AliEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLiET 
ILKHGDVEEAVRIVKEVIQKLANYEIPPEKIAIYEQITRPLHEYKAIGPHVAVAKKLAAKGVKIKPGMVI 
GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVIjPAVLRILEGFGYRKEDLiRYQKTRQVGLTSWL 
NIKKS 

(SEQ ID NO.2) 



Accordingly, preferred mutants according to the invention begin MAILDVDY or 
MLDVDY. 



The inventors have found (see Example 1) that a uracil-binding pocket of the wild 
type polymerase forms part of the ssDNA template binding cleft of the polymerase (the so 
call cleft T). Furthermore the uracil-binding pocket comprises amino acids from two 
conserved regions of the polymerases: Region A and Region B separated by an unconsented 
region. In the archaeal polymerase from Pyrococcus furiosus (Pfu-Pol) Region A is formed 
by the amino acids 1-40 and Region B by amino acids 78-130. Highly conserved residues in 
these two regions form the highly ordered uracil-binding pocket. Other archaea have similar 
regions A & B in their respective polymerases as illustrated in Figure 1 . Preferably, one or 
more of the amino acids in Regions A and/or B are altered to form the variant archaeal DNA 
polymerase. 



Figure 1 illustrates a sequence alignment of the N -terminal domains of various 
archaeal polymerases. In figure 1 amino acids designated (1) have 90% or greater identity, 
(2) indicates 80- 90% identity and (3) 60-80% identity. The two highly conserved regions 
that form the uracil binding pocket are: 

Region A, amino acids 1-40 in Pfii-Pol (and corresponding regions in the other 
polymerases); and 

Region B, amino acids 78-131 in Pfu-Pol (and corresponding regions in the other 
polymerases). 

It is preferred that the variant is formed by alteration of one of the amino acids block 
shaded (1, 2 or 3) in Figure 1 . For example the alteration may be in the motif: E - - I - 
F/Y- - -Y- -D. 

The alteration may consist of a substitution, deletion or addition. One of the 
invariant residues may be altered or other residues in Regions A and/or B that affect the 
conformation of the uracil-binding pocket. 

The inventors believe that residues 7, 36, 37, 90-97 and 112 - 119 in Pfu-Pol are 
particularly important for uracil binding. Preferably, at least one of these residues is altered 
to effect the conformation of the pocket and thereby reduce its uracil-binding ability. More 
preferably, the mutation in Pfu-Pol consists of a change in the amino acids Y7, Y37, V93, 
1114 or PI 15. More preferably, the change consists of Y7A, Y37A, V93Q, V93R, I114R, 
II 14Q or PI 15 A. A most preferred Pflx-Pol mutation is V93Q. 



Examples of preferred Pfu-Pol variants have the following amino acid sequences: 



(a) 



Pfu pol Y7A (Y8A) 



MAILDVDAITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGKIVRIVDVEK 
VEKKFLGKPITVWKLYLEHPQDVPTIREKV^EHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAF 
DIETLYHEGEEFGKGPI IMISYADENEAKVI TWKNIDLPYVEVVSS EREM I KRFLRI IREKDPDI IVTYN 
GDS FDFP YLAKRAEKLG I KLT I GRDGSE PKMQR I GDMTAVE VKGR I HFDLYHVITRT INLPTYTLE AVYE 
AI FGKPKEKVYADE I AKAWESGENLERVAKYSMEDAKATYELGKEFLPME IQLSRLVGQPLWDVSRS STG 
NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRES YTGGFVKEPEKGLWENIVYLDFRALYPS 1 1 ITHNVS 
PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDYRQKAIKLL 
ANS FYGYYGYAKARWYCKECAES VTAWGRKYI ELWKELEEKFGFKVLYIDTDGLYATI PGGESEE I KKK 
ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQAIIVLET 
ILKHGDVEEAVRIVKEVIQKIiANYEIPPEKIjAIYEQITRPLHEYKAIGPHVAVAKKIiAAKGVKIKPGMVI 
GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQKTRQVGLTSWL 
NIKKS 



MAILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPAIYALLRDDSKIEEVKKITGERHGKIVRIVDVEK 

VEKKFLGKPITVWKLYLEHPQDVPTIREKVREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAF 

DIETLYHEGEEFGKGPI IMISYADENEAKVI TWKNIDLPYVEVVSS EREM I KRFLRI IREKDPDI IVTYN 

GDSFDFPYIjAiCRAEKLG I KLT I GRDGSE PKMQR I GDMTAVEVXGR I HFDLYHVITRT INLPTYTLE 

AI FGKPKEKVYADE I AKAWESGENLERVAKYSMEDAKATYELGKEFLPME IQLSRLVGQPLWDVS RS STG 

NLVEWFLLRKAYERNEVAPNKPS EE EYQRRLRES YTGGFVKEPEKGLWENIVYLDFRALYPS 1 1 ITHNVS 

PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDYRQKAIKLL 

ANS FYGYYGYAKARWYCKECAESVTAWGRKYIELVWKELEEKFGFKVLYIDTDGLYATI PGGESEE I KKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVKRDWSEIAKETQARVLET 

I LKHGD VEE AVR I VKE VI QKLANYE I P PEKLAI YEQ I TRPLHE YKAI GPHVAVAKKLAAKGVKI KPGMVI 

GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQKTRQVGLTSWL 

NIKKS 



(c) Pfu pol V93Q (V94Q) 

MAILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGKIVRIVDVEK 
VEKKFLGKPITVWKLYLEHPQDQPTIREKVREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAF 
D I ETLYHEGEEFGKGP I IMISYADENEAKVITWKNIDLPYVEVVSSEREMIKRFLRI IREKDPDI IVTYN 
GDSFDFPYLAKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVBVKGRIHFDLYHVITRTINLPTYTLEAVYE 
AI FGKPKEKVYADE IAKAWESGENLERVAKYSMEDAKATYELGKEFLPME IQLSRLVGQPLWDVSRS STG 
NLVEWFLLRKAYERNEVAPNKPS EE EYQRRLRES YTGGFVKEPEKGLWENIVYLDFRALYPS 1 1 ITHNVS 
PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDYRQKAIKLL 
ANS FYGYYGYAKARWYCKECAESVTAWGRKYIELVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKK 
ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLET 
ILKHGDV^EAVRIVKEVIQKLANYEIPPEKIJVIYEQITRPLHEYKAIGPHVAVAKKLAAKGVXIKPG^WI 
GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQKTRQVGLTSWL 
NIKKS 



(SEQ ID NO.3) 



(b) 



Pfu pol Y37A (Y38A) 



(SEQ ID NO.4) 



(SEQ ID NO.5) 



(d) Pfu pol V93R (V94R) 



MAILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGKIVRIVDVEK 

VEKKFLGKPITVWKLYLEHPQDRPTIREKVREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGEEELKILAF 

DIETLYHEGEEFGKGPIIMISYATENEAKVITWKNIDLPYVEWSSEREMIKRFLRIIREKDPDIIVTYN 

GDSFDFPYLAKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPTYTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLWDVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 1 1 ITHNVS 

PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDYRQKAIKLL 

ANSFYGYYGYAKARWYCKECAESVTAWGRKYIELVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLET 

ILKHGDVEEAVRIVKEVIQKLANYEIPPEKIiAIYEQITRPLHEYKAIGPHVAVAKKLAAK^ 

GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQKTRQVGLTSWL 

NIKKS 

(SEQ ID NO.6) 



(e) PfupolI114R(IH5R) 

MAILDVDAITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGKIVRIVDVEK 

VE KKFLGKP I TVWKL YLEHPQDVPT I REKVREHP AWD I FEYDRPFAKRYL IDKGL I PMEGEEELKI LAF 

DIETLYHEGEEFGKGPIIMISYADENEAXVITWKNIDLPYVEVVSSEREMIKRFLRIIREKDPDIIVTYN 

GDSFDFPYLAKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPTYTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLWDVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRES YTGGFVKEPEKGLWENI VYLDFRALYPS 1 1 ITHNVS 

PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDYRQKAIKLL 

ANSFYGYYGYAKARWYCKECAESVTAWGRKYIELVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVIjET 

ILKHGDVEEAVRIVKEVIQKIiANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGVKIKP 

GY I VLRGDGP I SNRAI LAEE YDPKKHKYDAE YY I ENQVLPAVLR I LEGFGYRKEDLRYQKTRQVGLTS WL 

NIKKS 

(SEQ ID NO.7) 



(f) Pfu pol I114Q (IH5Q) 

MAILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPAIYALLRDDSKIEEVKKITGERHGKIVRIVDVEK 

VE KKFLGKP I TVWKL YLEHPQDVPT IREKVREHPAVVDIFEYDQPFAKRYLIDKGL I PMEGEEELKI LAF 

DIETLYHEGEEFGKGPIIMISYADENEAKVITWKNIDLPYVEWSSEREMIKRFLRIIREKDPDIIVTYN 

GDSFDFPYLAKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPTYTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLWDVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRES YTGGFVKEPEKGLWENIVYLDFRALYPS 1 1 ITHNVS 

PDTLNLEGCKNYD I APQVGHKFCKD I PGF I P S LLGHLLEERQK I KTKMKETQDP I EK I LLD YRQKAI KLL 

ANSFYGYY'GYAKARVT^CKECAESVTAWGRKYIELVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLET 

ILKHGDVEEAVRIVKEVIQKIANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAK^ 

GYI VLRGDGP I SNRAI LAEE YDPKKHKYDAE YY I ENQVLPAVLR I LEGFGYRKEDLRYQKTRQVGLTS WL 
NIKKS 

(SEQ ID NO.8) 
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It will be appreciated that the preferred mutants listed above comprise an alanine (A) 
insertion at position 2 which is not found in the wild type. Accordingly such mutants may be 
designated Y8A, Y38A, V94Q, V94R, I115R and I115Q mutants of the MAILDVDY form 
of Pfu Pol and correspond to Y7A, Y37A, V93Q, V93R, I114R and I114Q mutants of the 
true wild-type (MILDVDY). Y7A, Y37A, V93Q, V93R, I114R and I114Q mutants of the 
true wild-type (MILDVDY) are also preferred mutants according to the invention. 



It will be appreciated that equivalent residues in other archaeal polymerases may be 
mutated (see figure 1). For instance Y7A, Y37A, V93Q, V93R, I114R, I114Q and P115A 
remain preferred mutants in Tgo-pol, DTok-pol and 9°N-7-pol. 



The present invention also provides nucleic acids encoding an archaeal DNA 
polymerase variant as defined above. 



Examples of preferred mutant genes have the following DNA sequences: 
(a) Pfu pol Y7A (Y8A) 

ATGGCTATCCTGGACGTTGACGCCATCACCGAAGAAGGTAAGCCGGTTATCCGTCTGTTCAAAAAAGAAAACGGT 
AAATTCAAAATCGAACACGACCGTACCTTCCGTCCGTACATCTACGCTCTGCTGCGTGACGACTCTAAAATCGAA 
GAAGTTAAAAAAATCACCGGTGAACGTCATGGAAAGATTGTGAGAATTGTTGATGTAGAGAAGGTTGAGAAAAAG 
TTTCTCGGCAAGCCTATTACCGTGTGGAAACTTTATTTGGAACATCCCCAAGATGTTCCCACTATTAGAGAAAAA 
GTTAGAGAACATCCAGCAGTTGTGGACATCTTCGAATACGATATTCCATTTGCAAAGAGATACCTCATCGACAAA 
GGCCTAATACCAATGGAGGGGGAAGAAGAGCTAAAGATTCTTGCCTTCGATATAGAAACCCTCTATCACGAAGGA 
GAAGAGTTTGGAAAAGGCCCAATTATAATGATTAGTTATGCAGATGAAAATGAAGCAAAGGTGATTACTTGGAAA 
AACATAGATCTTCCATACGTTGAGGTTGTATCAAGCGAGAGAGAGATGATAAAGAGATTTCTCAGGATTATCAGG 
GAGAAGGATCCTGACATTATAGTTACTTATAATGGAGACTCATTCGACTTCCCATATTTAGCGAAAAGGGCAGAA 
AAACTTGGGATTAAATTAACCATTGGAAGAGATGGAAGCGAGCCCAAGATGCAGAGAATAGGCGATATGACGGCT 
GTAGAAGTCAAGGGAAGAATACATTTCGACTTGTATCATGTAATAACAAGGACAATAAATCTCCCAACATACACA 
CTAGAGGCTGTATATGAAGCAATTTTTGGAAAGCCAAAGGAGAAGGTATACGCCGACGAGATAGCAAAAGCCTGG 
GAAAGTGGAGAGAACCTTGAGAGAGTTGCCAAATACTCGATGGAAGATGCAAAGGCAACTTATGAACTCGGGAAA 
GAATTCCTTCCAATGGAAATTCAGCTTTCAAGATTAGTTGGACAACCTTTATGGGATGTTTCAAGGTCAAGCACA 
GGGAACCTTGTAGAGTGGTTCTTACTTAGGAAAGCCTACGAAAGAAACGAAGTAGCTCCAAACAAGCCAAGTGAA 
GAGGAGTATCAAAGAAGGCTCAGGGAGAGCTACACAGGTGGATTCGTTAAAGAGCCAGT^AAAGGGGTTGTGGGAA 
AACATAGTATACCTAGATTTTAGAGCCCTATATCCCTCGATTATAATTACCCACAATGTTTCTCCCGATACTCTA 
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AATCTTGAGGGATGC7UVGAACTATGATATCGCTCCTCAAGTAGGCCACAAGTTCTGCAAGGACATCCCTGGTTTT 
ATACCAAGTCTCTTGGGACATTTGTTAGAGGAAAGACAAAAGATTAAGACAAAAATGAAGGAAACTCAAGATCCT 
ATAGAAAAAATACTCCTTGACTATAGACAAAAAGCGATAAAACTCTTAGCAAATTCTTTCTACGGATATTATGGC 
TATGC7^AAAGCAAGATGGTACTGTAAGGAGTGTGCTGAGAGCGTTACTGCCTGGGGAAGAAAGTACATCGAGTTA 
GTATGGAAGGAGCTCGAAGAAAAGTTTGGATTTAAAGTCCTCTACATTGACACTGATGGTCTCTATGCAACTATC 
CCAGGAGGAGAAAGTGAGGAAATAAAGAAAAAGGCTCTAGAATTTGTAAAATACATAAATTCAAAGCTCCCTGGA 
CTGCTAGAGCTTGAATATGAAGGGTTTTATAAGAGGGGATTCTTCGTTACGAAGAAGAGGTATGCAGTAATAGAT 
GAAGAAGGAAAAGTCATTACTCGTGGTTTAGAGATAGTTAGGAGAGATTGGAGTGAAATTGCAAAAGAAACTCAA 
GCTAGAGTTTTGGAGACAATACTAAAACACGGAGATGTTGAAGAAGCTGTGAGAATAGTAAAAGAAGTAATACAA 
AAGCTTGCCAATTATGAAATTCCACCAGAGAAGCTCGCAATATATGAGCAGATAACAAGACCATTACATGAGTAT 
AAGGCGATAGGTCCTCACGTAGCTGTTGCAAAGAAACTAGCTGCTT^AAGGAGTTAAAATAAAGCCAGGAATGGTA 
ATTGGATACATAGTACTTAGAGGCGATGGTCCAATTAGCAATAGGGCAATTCTAGCTGAGGAATACGATCCCAAA 
AAGCACAAGTATGACGCAGAATATTACATTGAGAACCAGGTTCTTCCAGCGGTACTTAGGATATTGGAGGGATTT 
GGATACAGAAAGGAAGACCTCAGATACCAAAAGACAAGACAAGTCGGCCTAACTTCCTGGCTTAACATTAAAAAA 
TCC 



ATGGCTATCCTGGACGTTGACTACATCACCGAAGAAGGTAAGCCGGTTATCCGTCTGTTCAAAAAAGAAAACGGT 
AAATTCAAAATCGAACACGACCGTACCTTCCGTCCGTACATCTACGCTCTGCTGCGTGACGACTCTAAAATCGAA 
GAAGTTAAAAAAATCACCGGTGAACGTCATGGAAAGATTGTGAGAATTGTTGATGTAGAGAAGGTTGAGAAAAAG 
TTTCTCGGCAAGCCTATTACCGTGTGGAAACTTTATTTGGAACATCCCCAAGAT CAG CCCACTATTAGAGAAAAA 
GTTAGAGAACATCCAGCAGTTGTGGACATCTTCGAATACGATATTCCATTTGC7VAAGAGATACCTCATCGACAAA 
GGCCTAATACCAATGGAGGGGGAAGAAGAGCTAAAGATTCTTGCCTTCGATATAGAAACCCTCTATCACGAAGGA 
GAAGAGTTTGGAAAAGGCCCAATTATAATGATTAGTTATGCAGATGAAAATGAAGCAAAGGTGATTACTTGGAAA 
AACATAGATCTTCCATACGTTGAGGTTGTATCAAGCGAGAGAGAGATGATAAAGAGATTTCTCAGGATTATCAGG 
GAGAAGGATCCTGACATTATAGTTACTTATAATGGAGACTCATTCGACTTCCCATATTTAGCGAAAAGGGCAGAA 
AAACTTGGGATTAAATTAACCATTGGAAGAGATGGAAGCGAGCCCAAGATGCAGAGAATAGGCGATATGACGGCT 
GTAGAAGTCAAGGGAAGAATACATTTCGACTTGTATCATGTAATAACAAGGACAATAAATCTCCCAACATACACA 
CTAGAGGCTGTATATGAAGCAATTTTTGGAAAGCCAAAGGAGAAGGTATACGCCGACGAGATAGCAAAAGCCTGG 
GAAAGTGGAGAGAACCTTGAGAGAGTTGCCAAATACTCGATGGAAGATGCAAAGGCAACTTATGAACTCGGGAAA 
GAATTCCTTCCAATGGAAATTCAGCTTTCAAGATTAGTTGGACAACCTTTATGGGATGTTTCAAGGTCAAGCACA 
GGGAACCTTGTAGAGTGGTTCTTACTTAGGAAAGCCTACGAAAGAAACGAAGTAGCTCCAAACAAGCCAAGTGAA 
GAGGAGTATCAAAGAAGGCTCAGGGAGAGCTACACAGGTGGATTCGTTAAAGAGCCAGAAAAGGGGTTGTGGGAA 
AACATAGTATAC CTAGATTTT AGAGC CCTATATC CCTCGATTATAATTACC CACAATGTTTCTCC CGATACTCTA 
AATCTTGAGGGATGCAAGAACTATGATATCGCTCCTCAAGTAGGCCACAAGTTCTGCAAGGACATCCCTGGTTTT 
ATACCAAGTCTCTTGGGACATTTGTTAGAGGAAAGACAAAAGATTAAGACAAAAATGAAGG7UVACTCAAGATCCT 
ATAGAAAAAATACTCCTTGACTATAGACAAAAAGCGATAAAACTCTTAGCAAATTCTTTCTACGGATATTATGGC 
TATGCAAAAGCAAGATGGTACTGTAAGGAGTGTGCTGAGAGCGTTACTGCCTGGGGAAGAAAGTACATCGAGTTA 
GTATGGAAGGAGCTCGAAGAAAAGTTTGGATTTAAAGTCCTCTACATTGACACTGATGGTCTCTATGCAACTATC 
CCAGGAGGAGAAAGTGAGGAAATAAAGAAAAAGGCTCTAGAATTTGTAAAATACATAAATTCAAAGCTCCCTGGA 
CTGCTAGAGCTTGAATATGAAGGGTTTTATAAGAGGGGATTCTTCGTTACGAAGAAGAGGTATGCAGTAATAGAT 
GAAGAAGGAAAAGTCATTACTCGTGGTTTAGAGATAGTTAGGAGAGATTGGAGTGAAATTGCAAAAGAAACTCAA 
GCTAGAGTTTTGGAGACAATACT7UUVACACGGAGATGTTGAAGAAGCTGTGAGAATAGTAAAAGAAGTAATACAA 
AAGCTTGCCAATTATGAAATTCCACCAGAGAAGCTCGCAATATATGAGCAGATAACAAGACCATTACATGAGTAT 
AAGGCGATAGGTCCTCACGTAGCTGTTGCAAAGAAACTAGCTGCTAAAGGAGTTAAAATAAAGCCAGGAATGGTA 
ATTGGATACATAGTACTTAGAGGCGATGGTCCAATTAGCAATAGGGCAATTCTAGCTGAGGAATACGATCCCAAA 
AAGCACAAGTATGACGCAGAATATTACATTGAGAACCAGGTTCTTCCAGCGGTACTTAGGATATTGGAGGGATTT 
GGATACAGAAAGGAAGAC CTC AGAT AC CAAAAGACAAGACAAGTCGGCCTAACTTCCTGGCTTAACATTAAAAAA 
TCC 



(SEQ ID NO.9) 



(b) 



Pfu pol V93Q (V94Q) 



(SEQ ID NO JO) 



m 
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(c) Pfu Pol P115A (P116A) 

ATGGCTATCCTGGACGTTGACTACATCACCGAAGAAGGTAAGCCGGTTATCCGTCTGTTCAAAAAAGAAAACGGT 
AAATTCAAAATCGAACACGACCGTACCTTCCGTCCGTACATCTACGCTCTGCTGCGTGACGACTCTAAAATCGAA 
GAAGTTAAAAAAATCACCGGTGAACGTCATGGAAAGATTGTGAGAATTGTTGATGTAGAGAAGGTTGAGAAAAAG 
TTTCTCGGCAAGCCTATTACCGTGTGGAAACTTTATTTGGAACATCCCCAAGATGTTCCCACTATTAGAGAAAAA 
GTTAGAGAACATCCAGCAGTTGTGGACATCTTCGAATACGATATTTTTGCAAAGAGATACCTCATCGACAAAGGC 
CTAATACCAATGGAGGGGGAAGAAGAGCTAAAGATTCTTGCCTTCGATATAGAAACCCTCTATCACGAAGGAGAA 
GAGTTTGGAAAAGGCCCAATTATAATGATTAGTTATGCAGATGAAAATGAAGCAAAGGTGATTACTTGGAAAAAC 
ATAGATCTTCCATACGTTGAGGTTGTATCAAGCGAGAGAGAGATGATAAAGAGATTTCTCAGGATTATCAGGGAG 
AAGGATCCTGACATTATAGTTACTTATAATGGAGACTCATTCGACTTCCCATATTTAGCGAAAAGGGCAGAAAAA 
CTTGGGATTAAATTAACCATTGGAAGAGATGGAAGCGAGCCCT^AGATGCAGAGAATAGGCGATATGACGGCTGTA 
GAAGTCAAGGGAAGAATACATTTCGACTTGTATCATGTAATAACAAGGACAATAAATCTCCCAACATACACACTA 
GAGGCTGTATATGAAGCAATTTTTGGAAAGCCAAAGGAGAAGGTATACGCCGACGAGATAGCAAAAGCCTGGGAA 
AGTGGAGAGAACCTTGAGAGAGTTGCCAAATACTCGATGGAAGATGCAAAGGCAACTTATGJ\ACTCGGGAAAGAA 
TTCCTTCCAATGGAAATTCAGCTTTCAAGATTAGTTGGACAACCTTTATGGGATGTTTCAAGGTCAAGCACAGGG 
AACCTTGTAGAGTGGTTCTTACTTAGGAAAGCCTACGAAAGAAACGAAGTAGCTCCAAACAAGCCAAGTGAAGAG 
GAGTATCAAAGAAGGCTCAGGGAGAGCTACACAGGTGGATTCGTTAAAGAGCCAGAAAAGGGGTTGTGGGAAAAC 
ATAGTATACCTAGATTTTAGAGCCCTATATCCCTCGATTATAATTACCCACAATGTTTCTCCCGATACTCTAAAT 
CTTGAGGGATGCAAGAACTATGATATCGCTCCTCAAGTAGGCCACAAGTTCTGCAAGGACATCCCTGGTTTTATA 
CCAAGTCTCTTGGGACATTTGTTAGAGGAAAGACAAAAGATTAAGACAAAAATGAAGGAAACTCAAGATCCTATA 
GAAAAAATACTCCTTGACTATAGACAAAAAGCGATAAAACTCTTAGCAAATTCTTTCTACGGATATTATGGCTAT 
GCAAAAGCAAGATGGTACTGTT^AGGAGTGTGCTGAGAGCGTTACTGCCTGGGGAAGAAAGTACATCGAGTTAGTA 
TGGAAGGAGCTCGAAGAAAAGTTTGGATTTAAAGTCCTCTACATTGACACTGATGGTCTCTATGCAACTATCCCA 
GGAGGAGAAAGTGAGGAAATAAAGAAAAAGGCTCTAGAATTTGTAAAATACATAAATTCAAAGCTCCCTGGACTG 
CTAGAGCTTGAATATGAAGGGTTTTATAAGAGGGGATTCTTCGTTACGAAGAAGAGGTATGCAGTAATAGATGAA 
GAAGGAAAAGTCATTACTCGTGGTTTAGAGATAGTTAGGAGAGATTGGAGTGAAATTGCAAAAGAAACTCAAGCT 
AGAGTTTTGGAGACAATACTAAAACACGGAGATGTTGAAGAAGCTGTGAGAATAGTAAAAGAAGTAATACAAAAG 
CTTGCCAATTATGAAATTCCACCAGAGAAGCTCGCAATATATGAGCAGATAACAAGACCATTACATGAGTATAAG 
GCGATAGGTCCTCACGTAGCTGTTGCAAAGAAACTAGCTGCTAAAGGAGTTAAAATAAAGCCAGGAATGGTAATT 
GGATACATAGTACTTAGAGGCGATGGTCCAATTAGCAATAGGGCAATTCTAGCTGAGGAATACGATCCCAAAAAG 
CACAAGTATGACGCAGAATATTACATTGAGAACCAGGTTCTTCCAGCGGTACTTAGGATATTGGAGGGATTTGGA 
TACAGAAAGGAAGACCTCAGATACCAAAAGACAAGACAAGTCGGCCTAACTTCCTGGCTTAACATTAAAAAATCC 

(SEQ ID NO. 11) 



It will be appreciated that preferred mutants encoded by the DNA sequences listed 
above comprise a codon for an alanine (A) inserted at position 2 that is not found in the wild 
type. Accordingly such mutants may be designated Y8A, Y38A and P116A mutants of the 
MAILDVDY form of Pfu Pol and correspond to Y7A, Y37A and PI 15A mutants of the true 
wild-type (MILDVDY). 



It will also be appreciated that DNA sequences for other preferred mutants will be 
readily apparent given the abovementioned amino acid sequences. Accordingly DNA 
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molecules encoding Y7A, Y37A, V93A, II 14R, II 14Q and PI 15A mutants of the true wild- 
type (MILDVDY) are also preferred nucleic acids according to the invention. 

The variant polymerases as defined above are particularly useful for PCRs since they 
are thermally stable, have proofreading ability but are not stalled by the presence of dUTP. 

Accordingly, a further aspect of the present invention provides a kit useful for 
polymerase chain reactions comprising DNA to be amplified, free bases, primers and a 
variant archaeal DNA polymerase having a modified amino acid sequence of a wild-type 
amino acid sequence, the modified sequence being in the amino-terminal amino acids that 
comprise a uracil binding pocket in the wild-type polymerase whereby the variant polymerase 
has reduced affinity for uracil compared to the wild-type polymerase. 

The present invention further provides a method of amplifying DNA comprising the 
steps of (i) denaturing a double strand of DNA by heating a solution containing the DNA, 
free oligonucleotides, primers and a variant archaeal DNA polymerase having a modified 
amino acid sequence of a wild-type amino acid sequence, the modified sequence being in the 
amino-terminal amino acids that comprise a uracil binding pocket in the wild type 
polymerase whereby the variant polymerase has reduced affinity for uracil compared to the 
wild-type polymerase; (ii) reducing the temperature of the solution to effect annealing of the 
primer and the DNA and (iii) heating the solution to effect extension of DNA by the variant 
polymerase. 
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The ability of the variant DNA polymerase to amplify DNA in the presence of dUTP 
results in it being particularly suitable for PCR that uses dUTP rather than dTTP, for example 
in the prevention of contamination of samples. 

A preferred protocol for carrying out PCR utlising variant polymerases according to 
the method of the present invention is as follows: PCR may carried out under the following 
conditions: lOOfxl volume, 20 mM Tris-HCl pH 8.8, 10 mM KC1, 10 mM (NH 4 ) 2 S0 4 , 2 mM 
MgS0 4 , 0.1% Triton X100, 100 |ag/ml BSA, 250 jiM each of dATP, dGTP, dCTP and either 
250 |xM dTTP or 250 fiM dUTP, 2.5 units DNA polymerase overlayed with 40jil of mineral 
oil (1 unit of polymerase is defined as amount of enzyme that incorporates 10 nmol of dATP 
into acid-precipitable material using an activated calf-thymus DNA-based assay (4) in 30 min 
at 72°C). 5 ng of template DNA to be amplified is used and the concentration of the forward 
and reverse primers (each 18 bases in length) is 0.3 jaM. Each PCR consisted of 30 cycles of 
1 min at 95°C, 2 min at 52°C and 4.5 min at 72°C. 

The present invention will now be further illustrated by means of the following 
Examples in which Example 1 investigates the uracil-binding pocket of Archaea 
polymerases, in particular the hyperthermophilic archaea, Pyrococcus furiosus (Pfu-pol) and 
Example 2 investigates the effect of mutagenesis of residues in and around the uracil-binding 
pocket of archaeal DNA polymerases, and with reference to the accompanying drawings in 
which:- 
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Figure 1 illustrates an amino acid sequence alignment of archaeal family B DNA 
polymerases, corresponding to residues 1-130 of Pyrococcus furiosus polymerase. 
Candidates were identified using a WUBLAST search (European Bioinformatics Institute, 
http://www.ebi.ac.uk/ebi home.html) for homologues of Pfu-Pol. An additional ENTREZ 
search of the SWISSPROT database for family B DNA polymerises was performed 
[Genbank ( http://www.ncbi.nlm.nih.govy r ) l. Sequence alignments were generated using 
ClustalX (version 1.81) [J. D. Thompson et al, Nucl Acids. Res 24, 4876 (1997]. The 
organisms and the DNA polymerase sequence accession numbers were: Pyrococcus fuiosus 
(Pfu) (P80061), Thermococcus gorgonarius (Tgo) (pdb 1D5A), Pyrococcus kodakaraensis 
(PKOD) (gi/1 3399597), Desulfurococcus strain Tok (DTok), Thermococcus sp. 9°N-7 (9°N- 
7) (Q56366), Thermococcus litoralis (Tli) (AAA72101.1), Methanococcus voltae (Mvo) 
(P52025), Pyrobaculum islandicum (Pis) (AAF27815.1), Archaeoglobus fulgidus (Agu) 
(029753), Cenarchaeaum symbiosum (Csy) (AAC62712.1), Sulfolobus acidocaldarius (Sac) 
(P95690), Sulfurisphaera ohwakuensis (Soh) (BAA23994.1), Sulfolobus solfataricus (Sso) 
(P26811), Pyrodictium occultum (Poc) (BAA07579.1) and Aeropyrum pernix (Ape) 
(NPJ 48473.1); 

Figure 2A: The template-binding cleft T (21) of Tgo-Pol showing the presence of a 
pocket. B: The N-terminal domain of Tgo-Pol with amino acids that form the pocket shown 
in space-fill: Y7; P36/Y37; amino acids 90-97 (22); amino acids 112-116 (23). ot-Helices are 
shown (24) and P-sheets (25). C: Amino acid sequences of the N-terminal domains of Tgo- 
Pol (upper sequence (SEQ ID NO. 12)) and RB69-Pol (lower sequence (SEQ ID NO. 13)). 
Amino acids that form the pocket in Tgo-Pol (and the corresponding residues in RB69-Pol) 
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are underlined and correspond to the amino acids identified in panels B and E. Cylinders 
represent ct-helices and arrows P-sheets. The amino acid sequences have minimal homology 
and have been aligned using structural homology. D: Structural alignment of the N-terminal 
domains of Tgo-Pol and RB69-Pol. The insert in Tgo-Pol is shown (26). This structural 
superimposition was used to generate the amino acid alignment shown in C. E: The N- 
terminal domain of RB69-Pol. Space-filled amino acids (V8 Q10; residues 65-72 (27); 
residues 84-89 (28); P35/S36 lie substantially behind residues 84-89) correspond to those in 
Tgo-Pol (shown in panel B) which form the pocket. Both V8 and Q10 are near the position of 
the Tgo-Pol Y7. a-Helices are shown (24) and p-sheets (25). Images/structural homology 
models were produced using Swiss-Model [N. Guex, M. C. Peitsch, Electrophoresis 18, 2714 
(1997)] (http://www.expasv.ch/spdbv/) . POV-Ray [C. Cason, POV-Ray for Windows, 
version 3.1g (1999)] (http://www.povray.org) and Rasmol [R. Sayle, J. F. Milner- White, 
Trends Biochem. Sci. 20, 374 (1995)]. 

Figure 3: Modelling pyrimidines into the N-terminal domain pocket of Tgo-Pol using 
Web Lab Viewer Pro [Molecular Simulations Inc., Web Lab Viewer Pro (version 4.0) 
(2000)] (http://www.msi.com). Hatched lines represent hydrogen bonds; except steric 
clashes are identified (31). Uracil formed four enzyme-base hydrogen bonds and no clashes. 
Cytosine and thymine resulted in fewer hydrogen bonds and/or clashes. 

Figure 4 A and B: Primer extension reactions using a 5'- 32 P labeled 24-mer primer 
(5'-GGGGATCCTCTAGAGTCGACCTGC-3' (SEQ ID NO.14)) (2.5nM) 
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annealed to 44-mer template (5'- 

GGAGACAAGCTTG(UmATGCCT GCAGGTCGACTCTAGCGGCTAAAA-3 , (SEP ID 
NO. 15)) (5nM). The primer hybridises with the underlined section of the template to place a 
uracil (thymine in controls) seven bases from the primer-template junction (H.H. Hogrefe, C. 
J. Hansen, B. R. Scott, K. B. Nielson, Proc. Natl Acad. Sci. U.S.A., 99, 596 (2002)). In the 
presence of the four dNTPs all Pfu-Pol variants were able to fully copy the control template, 
lacking uracil, converting the labeled 24-mer primer into a product 44 bases long (A). With 
uracil in the template the wild-type enzyme (WT) stalled polymerisation (H.H. Hogrefe, C. J. 
Hansen, B. R. Scott, K. B. Nielson, Proc. Natl. Acad. Sci. U.S.A., 99, 596 (2002)) giving a 
truncated product (B). The arrow (41) indicates the major pause site, shown (using standards, 
not illustrated) to occur four base before uracil. With Y7A, V93Q and P115A some full- 
length product was observed (amounts: V93Q > P115A > Y7A) indicating read-through of 
the template-strand uracil (panel B). No full-length product was seen with Y37A, Y37F and 
PI 15F (some material partially extended past the "uracil induced pause" was seen with Y7A, 
Y37A and Y37F. The first and second lanes contain standard 24-mer and 44-mer. C and D: 
PCR reactions. In control PCR with the four normal dNTPs (dATP, dGTP, dCTP and dTTP) 
all the Pfu-Pol variants (and Taq-Pol) gave the anticipated PCR product, ~ 2 kbases in length 
(C). When dUTP replaced dTTP, Taq-Pol produced a 2 kbase fragment (D). With most Pfu- 
Pol variants (WT, Y37A, Y37F and P115F) no PCR product was seen (D). Three Pfu-Pol 
mutant's produced a PCR product (amounts: V93Q > PI 15A > Y7A) (2 x and 4 x loadings, 
respectively, were used for visualisation with P115A and Y7A). All PCR reactions were 
carried out under identical conditions with no optimisation; this probably accounts for the 



# 



16 

lower weight contaminant seen in some lanes. Markers (important sizes indicated) were from 
Promega. 

Figure 5: Polymerase chain reaction (PCR) with Thermus aquaticus polymerase 
(Taq-Pol) and Pyrococcus furiosus polymerase (Pfti-Pol) (wild type and two mutants, V93Q 
and V93R). A DNA fragment (~ 1 kilobase in length) from the plasmid pET-17b(Pfii-Pol) 
was amplified. Conditions 20 mM Tris-HCl, pH 8.8, 10 mM KC1, 10 mM (NH4) 2 S0 4 , 2 mM 
MgSC>4, 0.1 % (v/v) Triton XI 00 and 0.1 mg/ml bovine serum albumin. Each reaction 
contained 250 j^M each dATP, dGTP and dCTP. Reactions 1-4 contained 250 ^iM TTP; 
reactions 5-8 250 |jM dUTP. 2.5 units of the polymerase were used. Cycle 1x10 minutes 94 
°C; 30 x 1 minute 94 °C/2 minutes 42°C/1 minute 72 °C; 1 x 10 minutes 72 °C. The Pfu-Pol 
mutants V93Q and V93R give higher yield of PCR product (lanes 3,4) than wild type Pfu-Pol 
(lane 2) when TTP is used. Wild type Pfu-Pol does not give a PCR product with dUTP (lane 
6), whereas Pfu-Pol mutants V93Q and V93R give a product (lanes 7,8) [lane 1= Taq-Pol 
with TTP; lane 2= Wild type Pfu-pol with TTP; lane 3= Pfu-Pol V93Q with TTP; lane 4= 
Pfu-Pol V93R with TTP; lane 5= Taq-Pol with dUTP; lane 6= Wild type Pfu-pol with dUTP; 
lane 7= Pfu-Pol V93Q with dUTP; lane 8= Pfu-Pol V93R with dUTP] [M= lOObp ladder; 
Mb= 1Kb ladder]. 



* 
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Example 1 

Crystal structures are known for five family B DNA polymerases; one viral, the 
remaining four archaeal. The first structure to be solved was for the bacteriophage RB69 
polymerase (RB69-Pol) (J. Wang et al, Cell 89, 1087 (1997)); a structure with primer- 
template has also been determined (M. C. Franklin, J. J. Wang, T. A. Steitz, Cell 105, 657 
(2001)). More recently, the structure of an archaeal family B DNA polymerase, from the 
hyperthermophilic archaeon Thermococcus gorgonarius (Tgo-Pol), has been reported (K. P. 
Hopfner et al. 9 Structure 7, 1189 (1999)). Three other archaeal polymerase structures, 
Desulfurococcus strain Tok (DTok-Pol) (Y. Zhao et al, Structure 7, 1189 (1999)), 
Thermococcus sp. 9°N-7 (9°N-7-Pol) (A. C. Rodgriguez, H-W. Park, C. Mao, L. S. Beese, J. 
Mol. Biol. 299, 447 (200)) and Pyrococcus kodakaraensis KOD1 (KODl-Pol) (H. Hashimoto 
et al., J. Mol. Biol 306, 469 (2001)), were subsequently solved. Only apo-enzyme structures 
are known for the archaea. All five family B polymerases contain five distinct domains, the 
N-terminal domain, the exonuclease or 'editing 5 domain and three polymerase active site 
domains. The folding of the five domains forms three distinct clefts extending from a central 
hole. Two (named clefts D and T) are oriented approximately 1 80° relative to each other, on 
either side of the central hole. The structure of RB69-pol containing a primer-template 
demonstrates that cleft D binds double stranded primer-template and cleft T binds single- 
stranded template (M. C. Franklin et al supra). The three polymerase domains forms cleft D, 
whereas cleft T is formed by the exonuclease domain and the N-terminal domain. The third 
cleft is perpendicular to the other two and represents the 3 '-5' exonuclease/editing cleft. 
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Examination of cleft T in the case of Tgo-Pol (the other archaeal polymerases give 
similar results) revealed the presence of a pocket located on a surface exposed face towards 
the outer edge of the polymerase (figure 2A). The location of this pocket, in the template 
strand binding region, approximately four bases from the primer-template junction makes it a 
clear candidate for uracil recognition. The pocket is comprised of amino acids that are solely 
present in the N-terminal domain. Amino acids from four regions of this domain, which are 
close together in space, are used to assemble the putative uracil binding pocket. These amino 
acids are illustrated in figure 2B (a structural representation of the N-terminal domain) and 
underlined in figure 2C (the amino acid sequence of the N-terminal domain). The function of 
Y7, which sits at the entrance of the pocket, is obscure. It may form a lid, which closes 
following binding, ensuring trapping (and hence high affinity) of uracil. The base of the 
pocket (clearly visible as P36/Y37 in figure 2B) is formed by Y37, oriented by P36 at the 
beginning of a 0 sheet and supported by K84 "underneath" Y37. One side of the pocket is 
formed by amino acids 90-97, present in a cc-helix. A proline (P94) bends the oc-helix, 
forming a curved wall (figure 2B). The other side comprises residues 110-116; amino acids 
present in a loop region (110-1 14) or at the beginning of a second a-helix (1 15-1 16). This a- 
helix commences with amino acids PI 15 and F116. The proline appears to have a critical 
role, ensuring that PI 15 and F116 are able to form part of the pocket's curved wall (figure 
2B). 

The viral polymerase from RB-69 shows 61% amino acid identity to bacteriophage 
T4-Pol. Additionally, the N-terminal of RB69-Pol is identical to a structure for an N-terminal 
fragment of T4-Pol (J. Wang, P. Yu, T. C. Lin, W. H. Konigsberg, T.A. Steitz, Biochemistry 
35, 8110 (1996)). Previously, it has been demonstrated that T4-Pol did not stall 
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polymerisation in response to template-strand uracil (M. A. Greagg et al. 9 Proc Natl. Acad. 
Sci. U.S.A. 96, (1999)); inability to recognize uracil would also be expected for RB69-Pol. 
This contrasts with Tgo-Pol which stalls polymerization when template strand uracil is 
encountered (M. A. Greagg et aL, Supra), Tgo-Pol is also inhibited by uracil containing DNA 
(R. S. Lasken, D. M. Schuster, A. Rashtchian, J. Biol. Chem. 271, 17692 (1996)); a feature 
not seen with viral enzymes. Therefore, if the pocket seen with Tgo-Pol is used for uracil 
detection, it should be absent for the viral enzymes. The N-terminal domains of Tgo-Pol and 
RB69-Pol show considerable structural homology (figure 2D); near perfect alignment of 
secondary structural elements is observed, despite the almost complete lack of amino acid 
sequence homology (figure 2C). The only significant difference is the presence of an amino 
acid insert (shown in figure 2D) in the archaeal enzyme. However, detailed comparisons 
demonstrate differences in the two polymerases (figure 2E). With Tgo-Pol, Y7 may act as a 
lid and Y36 forms the base of the pocket; in RB-69 these residues are replaced with 
valine/glutamine (both these amino acids in RB69-Pol are near Y7 of Tgo-Pol and it is not 
clear which is the exact replacement) and serine respectively. Similarly Tgo-Pol uses a 
proline-containing kinked a-helix (residues 90-97) to form one wall of the pocket. In RB69- 
Pol the corresponding a-helix (residues 65-74) lacks the proline; therefore, the helix is 
straight and does not form a good wall. However, a key feature involves PI 15 and F116, 
which form part of one of the walls of the pocket with Tgo-Pol. In the case of RB69 the 
proline is missing and this results in the corresponding phenylalanine (F88) falling into and 
completely filling the pocket. As shown in figure IE this means that the viral enzyme lacks a 
uracil binding pocket (P35/S36 are substantially obscured). The differences between the viral 
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and archaeal enzymes, based on subtle changes to a few amino acids, provide compelling 
evidence that the N-terminal pocket is responsible for uracil detection. 

It was possible to model uracil into the pocket of Tgo-Pol (figure 3A). The most 
favourable orientation produces four hydrogen bonds between the protein and uracil. In all 
cases the protein uses the peptide backbone for hydrogen bond formation. The interactions 
comprise: 1114 (peptide -NH) to uracil C2 =0 group; El 1 1 (peptide =0) to uracil N3H; Y37 
(peptide =0) to uracil N3H and (peptide -NH) to uracil C4=0 (figure 3A). Cytosine 
superimposed at the same position as uracil forms only one hydrogen bond (1114 (peptide - 
NH) to uracil C2 =0) and the 4-NH2 group clashes with the main chain atoms of Y37 
(peptide =0 and -NH) (figure 3B). This clash could be relieved by repositioning the cytosine, 
but only at the expense of the one H-bond, resulting in no interactions between the base and 
the protein. Thymine superimposed at the uracil position could form most of the protein-base 
hydrogen bonds (interactions were identical to uracil except the Y37 (peptide -NH) to 
uracil/thymine N3H hydrogen bond was not formed). Critically the C5 -CH3 group showed a 
severe steric clash with the edges of the cyclic P36 side chain and the ring of Fl 16, thereby 
preventing binding of the base within the pocket (figure 3C). Thus the pocket is highly 
specific for binding uracil and able to discriminate against the 'normal' DNA pyrimidines. 

An amino acid sequence alignment has been carried out for the N-terminal domains of 
fourteen archaeal family B DNA polymerases (figure 1); eight were from the crenarchaea and 
six from the euryarchaea. Twelve of the polymerases were either thermophiles or 
hyperthermophiles, one was mesophilic (Methanococcus voltae (Mvo)) and one was 
psychrophilic (Cenarchaeaum symbiosum (Csy)). Two highly conserved regions (A and B) 
which contain most of the amino acids that from the uracil binding pocket, are seen. Many of 
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the amino acids comprising the uracil binding pocket (figure 2B and 2C) are highly 
conserved; especially uracil-contacting residues (figure 3 A). Thus P36, Y37, El 11 and 1114 
show 100 % identity. The possible pocket lid, Y7, also shows 100 % conservation. Several 
other key amino acids e.g. V93 (which lines one side of the pocket), PI 15 and Fl 16 (which 
line the other side of the pocket) show a high degree of conservation. 

Example 2 

Use was made of three assays to test the ability of the mutant polymerases to recognise uracil. 
Primer extension reactions (M. A. Greagg et aL, Supra) measure the ability of a polymerase 
to extend a primer through uracil bases in the template strand. As expected both the wild type 
and the mutant enzymes were able to completely copy a control template, lacking uracil 
(figure 4 A). As previously observed (M. A. Greagg et al. 9 Supra),, the wild type enzyme 
stalled polymerisation four bases upstream of template-strand uracil, resulting in a truncated 
product (figure 4B). Y37A, Y37F and P115F behaved in a similar manner to wild type. 
However, three of the mutant enzymes, V93Q, PI 15A and Y7A, produced full-length product 
when uracil was present (figure 4B). With V93Q full-length product predominated; in the 
cases of PI 1 5 A and Y7A both full-length and truncated product were seen. 

Next the ability of the polymerases to bind single stranded DNA containing uracil was 
investigated, using a binding assay based on fluorescence anisotropy (S. L. Reid, D. Parry, H- 
H. Liu, B. A. Connolly, Biochemistry 40, 2484 (2001)). K D was determined by fluorescence 
anisotropy used an oligodeoxynucleotide containing a single uracil and a 
hexachlorofluorescein label at its 5-terminal. The oligodeoxynucleotide used was: 5'-hex- 
GCCCGCGGGAUATCGGCCCTTA-3' (SEQ ID NO. 16) (or a control in which the uracil 
was replaced with thymine). The concentration of the oligodeoxynucleotide was 5 nM in 1 ml 
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of 10 mM Hepes-NaOH, pH7.5, lOOmM NaCl, ImM EDTA. Aliquots of the enzyme were 
added and the anisotropy measured; titration was continued until the anisotropy stopped 
increasing. Data fitting to obtain Kd values is as described by Reid et al {Supra)). K D values 
are summarised in table 1. The wild type enzyme bound the uracil-containing 
oligodeoxynucleotide with a Kd of 8.3 nM, a 17-fold preference over a control strand lacking 
this base. Three of the mutants, Y7A, V93Q and P115A bound to the uracil-containing 
oligodeoxynucleotide less well than the wild type (Kd values of 25.7, 144.5 and 84 nM 
respectively, table 1). These mutants correspond exactly to those able to read through a 
uracil-containing template; furthermore the diminution in uracil binding corresponds with 
read through capability. In both assays loss of uracil recognition is: V93Q > P115A >Y7A. 
These three mutants also show a reduced preference for uracil-containing DNA over the 
control sequence; with P115A and V93Q the preference is virtually abolished. The mutants 
Y7A, Y37A and PI 15F bind uracil-containing DNA with essentially the same affinity as the 
wild type (table 1). In some cases, Y37A and Y37F, the preference for the uracil-containing 
oligodeoxynucleotide is reduced, but this arises solely from tighter binding of the control. 
Table 1 also gives the specific activity of the mutant Pfu-Pols relative to the wild type. In 
general there are only small decreases in activity with even the mutant with the lowest 
activity (Y37A) retaining 38 % of the wild type activity. 

The DNA polymerase activity assay (Richardson, C. C. (1966) in Procedures in 
Nucleic Acids Research, G. L. Cantoni, D. R. Davies, Eds, (Harper and Row, New York, 
1966), pp. 263-27) used 50 ^1 samples containing 20 mM Tris-HCl, pH 8.8, lOmM KC1, 
lOmM (NH 4 ) 2 S0 4 , 2 mM MgS0 4 , 0.1% Triton XI 00, 100|ag/ml BSA, 200^M each dNTPs, 
0.2mg/ml activated calf thymus DNA (AP Biotech), l^iCi 3000Ci/mmol [a- 32 P] dATP. Pfu- 
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Pol (the amount varied depending on the activity of the enzyme) was added and a 1 0 minute 
incubation at 72°C was used (reactions were linear over this time). After this period the 
amount of radioactivity incorporated into acid-precipitable material was determined by 
scintillation counting. 1 unit of enzyme is defined as amount of enzyme that incorporates 10 
nmol of dATP into acid-precipitable material in 30 min at 72°C. 



Table 1. Pfu-Pol variants: specific activity and ability to bind to uracil containing DNA. 



Pfu-Pol 
variant 


specific activity 
(units/mg) 


% activity 
(relative to 
wild type) 


K D (nM) 
(uracil) 


K D (nM) 
(control) 


Preference 
for uracil 


wild-type 


3556 


100 


8± 1 


140 ± 10 


18 


Y7A 


2314 


65 


26 ± 1 


255 ± 20 


10 


Y37A 


1352 


38 


9±2 


57 ±7 


6 


Y37F 


2169 


61 


10±4 


120 ±13 


12 


V93Q 


1643 


46 


144 ±7 


277 ± 22 


2 


P115F 


2541 


71 


7± 1 


148 ± 10 


21 


P115A 


1637 


46 


84 ±4 


111 ±5 


1.3 



The specific activities of the Pfu-Pol variants was determined using the incorporation 
of [oc- 32 P]-dATP into acid-precipitable calf-thymus DNA as described (see above). Values 
are accurate ±15%. Binding constants were determined by fluorescence anisotropy using 
hex-GCCCGCGGGAUATCGGCCCTTA (SEQ ID NO. 16) (uracil) or an analogous 
oligodeoxynucleotide in which the uracil was replaced with thymine (control) (S. L. Reid, et 
al\ supra and J. Wang et al, (Supra)). Each value was determined three times and the average 
± one standard deviation is given. The preference for uracil is the ratio K D (uracu/Ko (control)- 
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Finally PCR was performed +/- dUTP the ability of Pfu-Pol to carry out PCR was 
evaluated by amplifying a - 2 kbase fragment (between the T7 promoter and the Hindlll site) 
of p£T17-b(Pfu-Pol) [S. J. Evans et al, Nucl Acids. Res. 28, 1059 (2000)]. Conditions: 
100|al volume, 20 mM Tris-HCl ph 8.8, lOmMKCl, lOmM (NH 4 ) 2 S0 4 , 0.1% Triton XI 00, 
100 |Lig/ml BSA, 250 \iM each dNTP (one set of reactions contained dTTP, the other dUTP), 
2.5 units DNA polymerase overlayed with 40 |il of mineral oil. 5 ng of pET17-b(Pfu-Pol) 
was used and the concentrations of the forward and reverse primers were both 0.3 jiM. Each 
reaction was 30 cycles of 1 min at 95°C, 2 min at 52°C and 4.5 min at 72°C. PCR with Taq- 
Pol was identical to Pfu-Pol save 10 mM Tris-HCl pH 8.8, 50 mM KC1, 0.08% NP-40, 1.5 
mM MgCh was used. Analysis used ethidium bromide-stained agarose gels. 

Pfu-Pol (wild type and mutants) and Taq-Pol, were able to perform PCR with the four 
normal dNTPs (figure 4C). When dUTP was used in place of dTTP, PCR with Taq-Pol was 
unaffected; however, wild-type Pfu-Pol gave no product (figure 4D). The three mutants that 
showed diminished uracil recognition in read-through and binding assays, Y7A, P115A and 
V93Q, gave a PCR product (figure 4D). The amount of PCR product produced was V93Q > 
P115A >Y7A, again matching the order found for loss of uracil recognition. dUTP 
(concentration 250 \xM) completely replaces dTTP in these reactions. The wild type enzyme 
is completely inhibited when 0.02 \xM dUTP is used to spike PCR reactions containing the 
four normal dNTPs (H. H. Hogrefe, et al (Supra)); clearly showing these three mutants are 
very disabled in uracil recognition. The Pfu-Pol mutants, Y37A, Y37F and P115F did not 
give a PCR product with dUTP. 
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Figure 5 shows the results of PCR with Thermus aquaticus polymerase (Taq-Pol), the 
wild type and two mutants (V93Q and V93R) of Pyrococcus furiosus polymerase (Pfu-Pol). 
PCR amplification was performed under two distinct sets of conditions, i.e. in the presence of 
TTP and in the presence of dUTP. 

As expected, all four polymerases, Taq-Pol, the wild type of Pfu-pol and the two 
mutations of Pfu-pol (V93Q, V93R), were able to successfully mediate amplification of the 
DNA sample in the presence of TTP. Clearly visible bands of 1064 bp fragments in lanes 1 to 
4 illustrate this successful amplification. The use of dUTP resulted in the expected amount of 
amplification when used together with Taq-Pol as illustrated in lane 5. Moreover, lane 6 
shows no amplified DNA sample and thereby confirmed that dUTP induces blockage of Pfu- 
Pol mediated amplification. Most importantly, however, lanes 7 and 8 show bands 
corresponding to 1064 bp fragments and thus confirm that both Pfu polymerase mutants are 
capable of amplification. This proves that, unlike wild type Pfu-Pol, the V93Q and V93R 
mutants of Pfu-Pol are not affected by dUTP-induced blockage. 

Overall, Figure 5 confirms the usefulness of the mutants according to the invention in 
PCR utilizing dUTP. 



DT04 fS PCT/PTO 1 3 OCT 2004 



MAILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGK 
IVRIVDVEK 

VEKKFLGKPITWKLYLEHPQDVPTIREKVREHPAVVDIFEYDIPFAKRYLIDKGLIPMEGE 
EELKILAF 

DIETLYHEGEEFGKGPIIMISYADENEAKVITWKNIDLPYVEVVSSEREMIKRFLRIIREKD 
PDIIVTYN 

GD S FD F P YLAKRAE KLG I KLT I GRDGS E PKMQR I GDMTAVE VKGR I HFDL YHV I TRT I NL PT 
YTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLW 
DVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 
IIITHNVS 

PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDY 
RQKAIKLL 

ANS F YG Y YGYAKARWYC KE CAE S VTAWGRK Y I ELVWKELE E KFGFKVL Y I DTDGL YAT I PGG 
ESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKE 
TQARVLET 

ILKHGDVEEAVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGV 
KIKPGMVI 

GY I VLRGDGP I SNRAI LAE E YD PKKHKYDAE Y Y I ENQVL PAVLR I LEG FG YRKEDLRYQKTR 

QVGLTSWL 

NIKKS 

(SEQ ID NO.l) 



type is: 

MILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGKI 
VRIVDVEK 

VEKKFLGKP I TVWKLYLEHPQDVPTI RE KVREHPAWD I FE YD I PFAKRYL I DKGL I PMEGE 
EELKILAF 

DIETLYHEGEEFGKGPIIMISYADENEAKVITWKNIDLPYVEWSSEREMIKRFLRI IREKD 
PDIIVTYN 

GD S FD F P YLAKRAE KLG I KLT I GRDGS E PKMQR I GDMTAVE VKGR I HFDL YHV I TRT I NL PT 
YTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLW 
DVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 
IIITHNVS 

PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDY 
RQKAIKLL 

ANS F YGYYGYAKAR WYCKE CAE SVTAWGRKY I ELVWKELE E KFGFKVL Y I DTDGLYAT I PGG 
ESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKE 
TQARVLET 

ILKHGDVEEAVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGV 
KIKPGMVI 

GY I VLRGDGP I SNRAI LAE E YD PKKHKYDAE YY I ENQVL PAVLR I LEGFGYRKEDLRYQKTR 

QVGLTSWL 

NIKKS 

(SEQ ID NO. 2) 



(a) Pfu pol Y7A (Y8A) 



MAILDVDAITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGK 
IVRIVDVEK 

VEKKFLGKP I TVWKLYLEH PQDVPT I REKVREHPAWD I FE YD I PFAKRYL I DKGL I PMEGE 
EELKILAF 

D I ETLYHEGEE FGKGP 1 1 M I S YADENE AKVI TWKN I DL PYVEWS S EREM I KRFLR 1 1 RE KD 
PDIIVTYN 

GDS FDF P YLAKRAE KLG I KLT I GRDGS E PKMQR I GDMTAVE VKGR I H FDL YHV I TRT I NL PT 
YTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLW 
DVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 
IIITHNVS 

PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDY 
RQKAIKLL 

ANS F YGY YG YAKARWYCKE CAE S VTAWGRKY I ELVWKELE E KFGF KVL Y I DTDGL YAT I PGG 
ESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKE 
TQARVLET 

ILKHGDVEEAVRIVKEVIQKLANYEIPPEKl^IYEQITRPLHEYKAIGPHVAVAKKLAAKGV 
KIKPGMVI 

GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQKTR 

QVGLTSWL 

NIKKS 

(SEQ ID NO. 3) 



(b) Pfu pol Y37A (Y38A) 

MAI LDVDYI TEEGKPVI RLFKKENGKFKI EHDRTFRPAI YALLRDDS KI EE VKKI TGERHGK 
IVRIVDVEK 

VE KKFLGKP I TVWKLYLEH PQDVPT I REKVREHPAWD I FEYD I PFAKRYL I DKGL I PMEGE 
EELKILAF 

DIETLYHEGEEFGKGPIIMISYADENEAKVITWKNIDLPYVEWSSEREMIKRFLRIIREKD 
PDIIVTYN 

GDS FDF P YLAKRAE KLG I KLT I GRDGS E PKMQR I GDMTAVE VKGR I HFDL YHV I TRT I NL PT 
YTLEAVYE 

AI FGKPKEKVYADE I AKAWESGENLERVAKYSMEDAKATYELGKEFLPME IQLSRLVGQPLW 
DVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 
IIITHNVS 

PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDY 
RQKAIKLL 

ANS F YGYYGYAKARWYCKE CAES VTAWGRKY I ELVWKELEEKFGFKVLY I DTDGL YAT I PGG 
ESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKE 
TQARVLET 

ILKHGDVEEAVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGV 
KIKPGMVI 

GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQKTR 

QVGLTSWL 

NIKKS 

(SEQ ID NO. 4) 



(c) Pfu pol V93Q (V94Q) 



MAILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGK 
IVRIVDVEK 

VEKKFLGKPITWKLYLEHPQDQPTIREKVREHPAWDIFEYDIPFAKRYLIDKGLIPMEGE 
EELKILAF 

DIETLYHEGEEFGKGPI IMISYADENEAKVITWKNIDLPYVEWSSEREMIKRFLRI IREKD 
PDIIVTYN 

GDSFDFPYLAKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPT 
YTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLW 
DVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 
IIITHNVS 

PDTLNLEGCKNYDI APQVGHKFCKDI PGFI PSLLGHLLEERQKI KTKMKETQDPI EKI LLDY 
RQKAIKLL 

ANS FYGYYGYAKARWYCKECAE S VTAWGRKYI ELVWKELEEKFGFKVLYI DTDGLYAT I PGG 
ESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKE 
TQARVLET 

I LKHGD VE E AVR I VKE V I QKLANYE I P PE KLA I YEQ I TRPLHE YKA I G PHVAVAKKIjAAKGV 
KIKPGMVI 

GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQKTR 

QVGLTSWL 

NIKKS 

(SEQ ID NO. 5) 



(d) Pfu pol V93R (V94R) 

MAILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGK 
IVRIVDVEK 

VE KKFLGKP I TVWKLYLEHPQDRPT I REKVREHPAWD I FE YD I P FAKRYL IDKGL I PMEGE 
EELKILAF 

DIETLYHEGEEFGKGPI IMISYADENEAKVITWKNIDLPYVEWSSEREMIKRFLRI IREKD 
PDIIVTYN 

GDSFDFPYLAKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPT 
YTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLW 
DVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 
IIITHNVS 

PDTLNLEGCKNYD I APQVGHKFCKD I PGFI PSLLGHLLEERQKI KTKMKETQD P I E KI LLD Y 
RQKAIKLL 

ANS FYGYYGYAKARWYCKE CAES VTAWGRKYI ELVWKELEEKFGFKVLYI DTDGLYAT I PGG 
ESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKE 
TQARVLET 

ILKHGDVEEAVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGV 
KIKPGMVI 

GYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKEDLRYQKTR 

QVGLTSWL 

NIKKS 

(SEQ ID NO. 6) 



(e) Pfu pol I114R (I115R) 

MAILDVDAITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERHGK 
IVRIVDVEK 

VEKKFLGKPITVWKLYLEHPQDVPTIREKVREHPAWDIFEYDRPFAKRYLIDKGLIPMEGE 
EELKILAF 

DIETLYHEGEEFGKGPIIMISYADENEAKVTTWKNIDLPYVEVVSSEREMIKRFLRI IREKD 
PDIIVTYN 

GDSFDFPYLAKRAEKLGIKLTIGRDGSEPKMQRIGDMTAVEVKGRIHFDLYHVITRTINLPT 
YTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLW 
DVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 
IIITHNVS 

PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDY 
RQKAIKLL 

ANS FYGYYGYAKARWYCKECAE S VTAWGRKY I ELVWKELE E KFGFKVL Y I DTDGLYAT I PGG 
ESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKE 
TQARVLET 

ILKHGDVEEAVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGV 
KIKPGMVI 

GY I VLRGDGP I SNRAI LAEE YDPKKHKYDAE YY I ENQVLPAVLRI LEGFGYRKEDLRYQKTR 

QVGLTSWL 

NIKKS 

(SEQ ID NO. 7) 



(f) Pfu pol I114Q (I115Q) 

MAILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPAIYALLRDDSKIEEVKKITGERHGK 
IVRIVDVEK 

VEKKFLGKPITVWKLYLEHPQDVPTIREKVREHPAWDI FEYDQPFAKRYLIDKGLI PMEGE 
EELKILAF 

DIETLYHEGEEFGKGPIIMISYADENEAKVITWKNIDLPYVEWSSEREMIKRFLRIIREKD 
PDIIVTYN 

GDS FD F P YLAKRAE KLG I KLT I GRDGS E P KMQR I GDMTAVE VKGR I HFDL YHV I TRT I NL PT 
YTLEAVYE 

AIFGKPKEKVYADEIAKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLW 
DVSRSSTG 

NLVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFRALYPS 
IIITHNVS 

PDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTKMKETQDPIEKILLDY 
RQKAIKLL 

ANS FYGYYGYAKARWYCKECAE S VTAWGRKY I ELVWKELE E KFGFKVL Y I DTDGLYAT I PGG 
ESEEIKKK 

ALEFVKYINSKLPGLLELEYEGFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKE 
TQARVLET 

ILKHGDVEEAVRIVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGV 
KIKPGMVI 

GY I VLRGDGP I SNRAI LAEEYDPKKHKYDAEYY I ENQVLPAVLRI LEGFGYRKEDLRYQKTR 

QVGLTSWL 

NIKKS 

(SEQ ID NO. 8) 



(a) Pfu pol Y7A (Y8A) 



ATGGCTATCCTGGACGTTGACGCCATCACCGAAGAAGGTAAGCCGGTTATCCGTCTGTTCAA 
AAAAGAAAACGGTAAATTCAAAATCGAACACGACCGTACCTTCCGTCCGTACATCTACGCTC 
rGCTGCGTGACGACTCTAAAATCGAAGAAGTTAAAAAAATCACCGGTGAACGTCATGGAAAG 
ATTGTGAGAATTGTTGATGTAGAGAAGGTTGAGAAAAAGTTTCTCGGCAAGCCTATTACCGT 
GTGG AAACTTT ATTTGG AAC AT C C C C AAGATGTT CC C ACT ATTAG AGAAAAAGTTAG AGAAC 
AT C CAGC AGTTGTGG AC AT CT T CGAAT ACGAT ATTC C ATTTGC AAAGAG AT AC CT C AT CGAC 
AAAGGC CT AAT AC CAATGG AGGGGGAAGAAGAGCT AAAGATT CTTG C CTT CG AT ATAG AAAC 
C CT CTAT C ACGAAGG AGAAGAGTTTGGAAAAGGC CC AATTAT AATG ATTAGTT ATGC AG ATG 
AAAATGAAGCAAAGGTGATTACTTGGAAAAACATAGATCTTCCATACGTTGAGGTTGTATCA 
AGCGAGAGAG AG ATGAT AAAGAG ATTT CT CAGG ATT ATCAGGGAGAAGGAT C CTGAC ATT AT 
AGTTACTTATAATGGAGACTCATTCGACTTCCCATATTTAGCGAAAAGGGCAGAAAAACTTG 
GGATT AAATT AACC ATTGGAAGAGATGG AAG CG AGC C C AAGATGC AGAGAAT AGGCGAT ATG 
ACGGCTGTAGAAGTCAAGGGAAGAATACATTTCGACTTGTATCATGTAATAACAAGGACAAT 
AAAT CT CCCAAC AT ACAC ACT AGAGGCTGT ATATGAAGC AATTTTTGGAAAGC CAAAGGAGA 
AGGT AT ACGCCG ACGAGAT AG C AAAAGCCTGGGAAAGTGGAGAG AAC CTTGAGAGAGTTGC C 
AAATACTCGATGGAAGATGCAAAGGCAACTTATGAACTCGGGAAAGAATTCCTTCCAATGGA 
AATTCAGCTTTCAAGATTAGTTGGACAACCTTTATGGGATGTTTCAAGGTCAAGCACAGGGA 
ACCTTGTAGAGTGGTTCTTACTTAGGAAAGCCTACGAAAGAAACGAAGTAGCTCCAAACAAG 
C C AAGTGAAGAGGAGT ATCAAAGAAGGCTC AGGGAG AGCTAC ACAGGTGGATT CGTT AAAGA 
GCCAGAAAAGGGGTTGTGGGAAAACATAGTATACCTAGATTTTAGAGCCCTATATCCCTCGA 
TTATAATTACCCACAATGTTTCTCCCGATACTCTAAATCTTGAGGGATGCAAGAACTATGAT 
ATCGCTCCTCAAGTAGGCCACAAGTTCTGCAAGGACATCCCTGGTTTTATACCAAGTCTCTT 
GGGAC ATTTGTT AGAGGAAAGACAAAAGATT AAGAC AAAAATGAAGG AAACT CAAGAT C CTA 
TAGAAAAAATACTCCTTGACTATAGACAAAAAGCGATAAAACTCTTAGCAAATTCTTTCTAC 
GGATATTATGGCTATGCAAAAGCAAGATGGTACTGTAAGGAGTGTGCTGAGAGCGTTACTGC 
CTGGGGAAGAAAGT AC AT CGAGTT AGTATGG AAGGAG CT CGAAG AAAAGTTTGGATTTAAAG 
T C CT CT ACATTGAC ACTG ATGGT CT CTATGCAACT AT C C C AGGAGGAGAAAGTGAGGAAATA 
AAGAAAAAGGCTCTAGAATTTGTAAAATACATAAATTCAAAGCTCCCTGGACTGCTAGAGCT 
TGAAT ATGAAGGGTTTT AT AAGAGGGGATT CTT CGTTACGAAGAAGAGGT ATGC AGT AAT AG 
ATGAAGAAGGAAAAGTCATTACTCGTGGTTTAGAGATAGTTAGGAGAGATTGGAGTGAAATT 
GCAAAAGAAACTCAAGCTAGAGTTTTGGAGACAATACTAAAACACGGAGATGTTGAAGAAGC 
TGTGAGAAT AGT AAAAGAAGT AAT AC AAAAGCTTGC C AATT ATGAAATT C CACC AGAG AAGC 
TCGC AAT AT ATGAGCAGAT AAC AAGAC C ATT AC ATGAGT AT AAGGCGAT AGGTCCT C ACGTA 
GCTGTTGCAAAGAAACTAGCTGCTAAAGGAGTTAAAATAAAGCCAGGAATGGTAATTGGATA 
CATAGT ACTTAG AGGCGATGGT C C AATT AG C AAT AGGGC AATTCT AGCTGAGGAAT ACGATC 
C C AAAAAGCAC AAGT ATG ACG C AGAAT ATT ACATTGAGAAC C AGGTT CTT C C AGCGGTACTT 
AGGAT ATTGGAGGGATTTGGAT AC AGAAAGG AAGAC CT C AG AT AC C AAAAGAC AAG ACAAGT 
CGGCCTAACTTCCTGGCTTAACATTAAAAAATCC 

(SEQ ID NO. 9) 

(b) Pfu pol V93Q (V94Q) 

ATGGCTATCCTGGACGTTGACTACATCACCGAAGAAGGTAAGCCGGTTATCCGTCTGTTCAA 
AAAAGAAAACGGTAAATTCAAAATCGAACACGACCGTACCTTCCGTCCGTACATCTACGCTC 
TGCTGCGTGACGACTCTAAAATCGAAGAAGTTAAAAAAATCACCGGTGAACGTCATGGAAAG 
ATTGTGAGAATTGTTGATGTAGAGAAGGTTGAG AAAAAGTTT CT CGGC AAGC CT ATT AC CGT 
GTGGAAACTTT ATTTGG AAC AT C C CCAAGAT C AGC C C ACT ATTAGAGAAAAAGTT AGAGAAC 
AT C CAGC AGTTGTGGACAT CTT CGAAT ACG AT ATT C C ATTTGC AAAGAGAT AC CT CAT CGAC 
AAAGGC CT AAT AC CAATGG AGGGGGAAGAAG AGCTAAAGATT CTTGC CTT CGAT AT AGAAAC 
C CT CT ATCACG AAGGAGAAGAGTTTGG AAAAGGC C C AATT AT AATGATT AGTT ATG C AGATG 
AAAATG AAGC AAAGGTGATTACTTGGAAAAACAT AGAT CTT C CAT ACGTTGAGGTTGTATCA 



AGCGAG AG AG AG ATG AT AAAG AGATTTCTC AGG ATT AT C AGGG AGAAGG AT C CTGAC ATT AT 
AGTTACTTATAATGGAGACTCATTCGACTTCCCATATTTAGCGAAAAGGGCAGAAAAACTTG 
GGATT AAATT AAC C ATTGGAAGAGATGGAAG CG AGC C C AAGATGC AG AGAAT AGGCG AT ATG 
ACGGCTGTAGAAGTCAAGGGAAGAATACATTTCGACTTGTATCATGTAATAACAAGGACAAT 
AAATCTCCCAACATACACACTAGAGGCTGTATATGAAGCAATTTTTGGAAAGCCAAAGGAGA 
AGGTAT ACGC CGACGAG ATAGC AAAAGC CTGGGAAAGTGGAG AG AAC CTTGAGAG AGTTGC C 
AAAT ACT CGATGGAAGATGCAAAGGCAACTT ATG AACTCGGG AAAG AATTCCTTCCAATGGA 
AATTCAGCTTTCAAGATTAGTTGGACAACCTTTATGGGATGTTTCAAGGTCAAGCACAGGGA 
ACCTTGTAGAGTGGTTCTTACTTAGGAAAGCCTACGAAAGAAACGAAGTAGCTCCAAACAAG 
CC AAGTGAAGAGGAGT AT C AAAG AAGG CT C AGGGAGAG CT AC AC AGGTGG ATT CGTT AAAG A 
GC C AGAAAAGGGGTTGTGGG AAAACAT AGT AT AC CT AGATTTT AG AGC C CTATAT C CCTCGA 
TT ATAATT AC C C ACAATGTTT CT C C CGAT ACT CT AAATCTTGAGGG ATGCAAGAACT ATGAT 
ATCGCTCCTCAAGTAGGCCACAAGTTCTGCAAGGACATCCCTGGTTTTATACCAAGTCTCTT 
GGGACATTTGTTAGAGGAAAGACAAAAGATTAAGACAAAAATGAAGGAAACTCAAGATCCTA 
TAGAAAAAATACTCCTTGACTATAGACAAAAAGCGATAAAACTCTTAGCAAATTCTTTCTAC 
GGAT ATTATGG CTATGC AAAAGC AAGATGGTACTGTAAGGAGTGTGCTGAG AG CGTT ACTGC 
CTGGGGAAGAAAGTAC ATCG AGTT AGTATGGAAGGAG CT CG AAGAAAAGTTTGGATTT AAAG 
TCCTCTACATTGACACTGATGGTCTCTATGCAACTATCCCAGGAGGAGAAAGTGAGGAAATA 
AAGAAAAAGGCT CTAGAATTTGTAAAAT ACAT AAATT CAAAGCTCCCTGGACTGCTAGAGCT 
TGAAT ATGAAGGGTTTTATAAGAGGGGATT CTT CGTTACGAAG AAG AGGTATGCAGT AAT AG 
ATGAAGAAGGAAAAGTCATTACTCGTGGTTTAGAGATAGTTAGGAGAGATTGGAGTGAAATT 
GCAAAAGAAACTCAAGCTAGAGTTTTGGAGACAATACTAAAACACGGAGATGTTGAAGAAGC 
TGTGAG AAT AGT AAAAGAAGT AATAC AAAAGCTTGC CAATT ATGAAATT CCAC C AGAGAAGC 
T CGC AATATATG AGCAGATAAC AAGACCATT ACATGAGTAT AAGGCGAT AGGT C CT CACGT A 
GCTGTTGCAAAGAAACTAGCTGCTAAAGGAGTTAAAATAAAGCCAGGAATGGTAATTGGATA 
CATAGTACTTAGAGGCGATGGTCCAATTAGCAATAGGGCAATTCTAGCTGAGGAATACGATC 
CC AAAAAGC AC AAGT ATGACGCAGAAT ATTACATTG AGAAC CAGGTT CTT C C AGCGGT ACTT 
AGGATATTGGAGGGATTTGGATACAGAAAGGAAGACCTCAGATACCAAAAGACAAGACAAGT 
CGGCCTAACTTCCTGGCTTAACATTAAAAAATCC 

(SEQ ID NO. 10) 



(c) Pfu Pol P115A (P116A) 

ATGGCTATCCTGGACGTTGACTACATCACCGAAGAAGGTAAGCCGGTTATCCGTCTGTTCAA 
AAAAGAAAACGGTAAATTCAAAATCGAACACGACCGTACCTTCCGTCCGTACATCTACGCTC 
TGCTGCGTGACGACTCTAAAATCGAAGAAGTTAAAAAAATCACCGGTGAACGTCATGGAAAG 
ATTGTGAGAATTGTTGATGTAGAGAAGGTTGAGAAAAAGTTTCTCGGCAAGCCTATTACCGT 
GTGGAAACTTT ATTTGGAACATCCC CAAGATGTT C C CACT ATT AGAGAAAAAGTTAGAGAAC 
AT C CAGC AGTTGTGGACATCTT CGAAT ACGATATTTTTGC AAAGAGAT ACCT C AT CG AC AAA 
GGCCTAATACCAATGGAGGGGGAAGAAGAGCTAAAGATTCTTGCCTTCGATATAGAAACCCT 
CTATCACGAAGGAGAAGAGTTTGGAAAAGGCCCAATTATAATGATTAGTTATGCAGATGAAA 
ATGAAGCAAAGGTGATTACTTGGAAAAACATAGATCTTCCATACGTTGAGGTTGTATCAAGC 
GAG AGAGAGATG ATAAAGAGATTT CT CAGGATT AT C AGGGAGAAGGATC CTGACATT AT AGT 
TACTTATAATGGAGACTCATTCGACTTCCCATATTTAGCGAAAAGGGCAGAAAAACTTGGGA 
TTAAATTAACCATTGGAAGAGATGGAAGCGAGCCCAAGATGCAGAGAATAGGCGATATGACG 
GCTGTAGAAGT C AAGGGAAGAAT AC ATTTCG ACTTGTAT C ATGTAATAACAAGGAC AAT AAA 
TCT C CCAAC AT AC AC ACTAGAGGCTGT ATATGAAGCAATTTTTGGAAAGCC AAAGGAG AAGG 
TAT ACGC CGACGAGAT AGC AAAAGC CTGGGAAAGTGGAGAGAAC CTTGAGAGAGTTGCCAAA 
TACTCGATGGAAGATGCAAAGGCAACTTATGAACTCGGGAAAGAATTCCTTCCAATGGAAAT 
TCAGCTTTCAAGATTAGTTGGACAACCTTTATGGGATGTTTCAAGGTCAAGCACAGGGAACC 
TTGT AGAGTGGTT CTTACTT AGGAAAGC CTACGAAAGAAACGAAGTAGCTC C AAAC AAGC C A 
AGTGAAG AGGAGT AT CAAAGAAGGCT CAGGGAGAGCT AC AC AGGTGG ATT CGTTAAAGAG C C 
AG AAAAGGGGTTGTGGG AAAACAT AGT AT AC CT AG ATTTT AGAGC C CTATAT C C CT CGATT A 
TAATT AC CCACAATGTTTCTCC CGAT ACTCTAAATCTTGAGGGATGCAAGAACT ATGAT ATC 



GCTCCTCAAGTAGGCCACAAGTTCTGCAAGGACATCCCTGGTTTTATACCAAGTCTCTTGGG 
AC ATTTGTT AGAGG AAAGAC AAAAG ATT AAG ACAAAAATGAAGG AAACT CAAG AT C CT ATAG 
AAAAAATACTCCTTGACTATAGACAAAAAGCGATAAAACTCTTAGCAAATTCTTTCTACGGA 
TATTATGGCTATGCAAAAGCAAGATGGTACTGTAAGGAGTGTGCTGAGAGCGTTACTGCCTG 
GGG AAG AAAGT AC AT CGAGTT AGT ATGGAAGG AG CT CGAAG AAAAGTTTGGATTT AAAGT C C 
TCTACATTGACACTGATGGTCTCTATGCAACTATCCCAGGAGGAGAAAGTGAGGAAATAAAG 
AAAAAGGCTCTAGAATTTGTAAAATACATAAATTCAAAGCTCCCTGGACTGCTAGAGCTTGA 
ATATGAAGGGTTTTATAAGAGGGGATTCTTCGTTACGAAGAAGAGGTATGCAGTAATAGATG 
AAGAAGGAAAAGTCATTACTCGTGGTTTAGAGATAGTTAGGAGAGATTGGAGTGAAATTGCA 
AAAGAAACTCAAGCTAGAGTTTTGGAGACAATACTAAAACACGGAGATGTTGAAGAAGCTGT 
GAGAAT AGT AAAAG AAGTAATACAAAAG CTTGC C AATT ATGAAATT CCACCAGAG AAGCT CG 
C AAT AT ATGAGC AG AT AAC AAGAC CATT ACATGAGT AT AAGGCGATAGGTC CTC ACGT AGCT 
GTTGCAAAGAAACTAGCTGCTAAAGGAGTTAAAATAAAGCCAGGAATGGTAATTGGATACAT 
AGT ACTTAGAGG CG ATGGT C C AATTAGC AAT AGGGC AATT CT AGCTGAGGAAT ACGATCC C A 
AAAAGCACAAGTATGACGCAGAATATTACATTGAGAACCAGGTTCTTCCAGCGGTACTTAGG 
AT ATTGGAGGGATTTGGAT ACAGAAAGGAAG AC CT C AGAT AC C AAAAGAC AAGAC AAGT CGG 
CCTAACTTCCTGGCTTAACATTAAAAAATCC 

(SEQ ID NO. 11) 



Seq id no. 12 

MILDTDYITEDGKPVIRIFKKENGEFKIDYDRNFEPYIYALLKDDSAIEDVKKITAERHGTT 
VRWRAEKVKKKFLGRP I E VWKLYFTHPQDVPAI RDKI KEH PAWD I YE YD I PFAKRYL I DK 
GLIPME 

Seq id no . 13 

MKEFYLTVEQIGDS I FERYIDSNGRERTREVEYKPSLFAHCPESQATKYFDI YGKPCTRKLF 
ANMRDASQWI KRMED I GLEALGMDDFKLAYLSDTYNYE I KY 

5 , -GGGGATCCTCTAGAGTCGACCTGC-3 f (SEQ ID NO. 14)) 



(5 ; - GGAGACAAGCTTG ( U/ T ) ATGCCTGCAGGTCGACTCTAGCGGCTAAAA - 3 ' ) FORMER 
SEQ ID NO 15; 

SEQ ID NO. 15 

GGAGACAAGCTTGUATGCCTGCAGGTCGACTCTAGCGGCTAAAA 
SEQ ID NO. 17 

GGAGACAAGCTTGTATGCCTGCAGGTCGACTCTAGCGGCTAAAA 



(SEQ ID NO. 16) 

5' -hex-GCCCGCGGGAUATCGGCCCTTA- 3 1 



Pfu (SEQ ID NO 18) 

M I LDVDY I TE EGKPVI RLFKKENGKFKI EHDRTFRP Y I YALLRDDS KI EEVKKI TGERHGKI 
VRIVDVEKVEKKFLGKPITVWKLYLEHPQDVPTIREKVREHPAWDIFEYDIPFAKRYLIDK 
GLIPMEG 



Tgo 



MI LDTD YI TEDGKPVI R I FKKENGEFKI D YDRNFE P Y I YALLKDDS AI EDVKK I TAERHGTT 
VRWRAE KVKKKFLGR P I E VWKLYFTHPQDVP AI RDK I KEH PA WD I YE YD I P FAKRYL I DK 
GLIPMEG 

PKOD 

MI LDTDYITEDGKPVIRI FKKENGEFKI EYDRTFEPYFYALLKDDSAIEEVKKITAERHGTV 
VTVKRVEKVQKKFLGRPVEVWKLYFTHPQDVPAIRDKIREHPAVIDIYEYDIPEAKRYLIDK 
GLVPMEG 

DTok 

M I LDADY I TEDGKPVI RVFKKEKGE FKI DYDRDFE P Y I YALLKDDS AI EDI KKI TAERHGTT 
VRVTRAERVKKKFLGRPVEVWKLYFTHPQDVPAIRDKIREHPAWDIYEYDIPFAKRYLIDR 
GLIPMEG 

9° N-7 

M I LDTD YI TENGKPVI RVFKKENGEFKI E YDRTFE PYFYALLKDDS AI EDVKKVTAKRHGTV 
VKVKRAE KVQKKE FLGR P I E VWKL YFNH PQDVPAI RDR I RAH PA WD I YE YD I P FAKRYL I D 
KGLIPMEG 

Tli 

MILDTDYITKDGKPIIRIFKKENGEFKIELDPHFQPYIYALLKDDSAIEEIKAIKGERHGKT 
VRVLDAVKVRKKFLGRE VE VWKL I FEH PQDVPAMRGK I REHPAWD I YE YD I P FAKRYL I DK 
GLIPMEG 

Mvo 

MDLDYNSKDLCIDMYYKNCGLKKPEINLQKECEFKPYFYVDTSEPKEIYDYLDGLNQEIDLK 
KLEPEFENNTSLKVQDLITNIEI IEKIVYSDYILNGKDISEVSDFKNKKERKICKVYVKYPN 
HVKI IREYFKEFGKSYEFDI PFLRRYMIDQDI VPSAK 

Pis 

MELKVWPLD I T YAWGS VPE I RI FGI LS SGERWL I DRS FKP YF YVDCAVCE PAALKTALSR 
VAPIDDVQIVERRFLGRSKKFLKVIAKIPEDVRKLREAAMSIPRVSGVYEADIRFYMRYMID 
MGWPCSW 



Afu 

ME RVEGWL I DAD YET I GGKAWRLWCKDDQG I FVAYD YNFD P YF YV I GVDED I LKNAAT S TR 
REVIKLKSFEKAQLKTLGREVEGYIVYAHHPQHVPKLRDYLSQFGDVREADIPFAYRYLIDK 
DLACMDG 

Csy 

TVQDAVEIPPSLLVSATYDSQAGAWLKFYEPESQKIVHWTDNTGHKPYCYTRQPPSELGEL 
EGREDVLGTEQVMRHDLIADKDVPVTKITVADPLAIGGTNSEKSIRNIMDTWESDIKYYENY 
LYDKSLWGRY 



Sac 



WIKEAEDGKVYFLLQVDYDGKKSRAVCKIjYDKEGKKIYIMQDESGHKPYFLTDIDPDKVNKI 

TKVVRDPSFDHLELINKVDPYTGKKIRLTKIVV 

DNGLIPGLI 

Soh 

WIKEAEEGKSYFLLQVDYDGKKSKAICKLYDKETKKIYILYDNTGHKPYFLTDIDPEKVNKI 
PKWRDPSFDHLETVIKIDPYSGNKIKLTKIWKDPLAVRRMRNSVPKAYEAHIKYFNNYIY 
DLGLIPGLP 

Sso 

WLEEAQENKIYFLLQVDYDGKKGKAVCKLFDKETQKIYALYDNTGHKPYFLVDLEPDKVGKI 
PKIRDPSFDHIETVSKIDPYTWNKFKLTKIWRDPLAVRRLRNDVPKAYEAHIKYFNNYMYD 
IGLIPGMP 

Poc 

KPLEARDGVEGFLLQTMYDGERGVAAAKI YDDRNG I VYVYFDRTGYMP YFLTD I P PDKLQEL 
HE WRHKGFDHVE WE KFDLLRWQRRKVTK I WKT PD WRVLRDKVPRAWEAN I KFHHNY I Y 
DYGLVPGMK 

Ape 

VREPWVESVRGYLLDVRYDGSLGKAVLMLYDPSSGSLVKWADRTGHKPYFLTDARPEDLRAA 
GVDVSHDE S FLQ YDL VE KFH P I DRKL VKL YKI WSD PLAVRRLRE KVS S AGFS VWE AD I KYH 
HNY I FDRQL IPGIL 



